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Semantic  Search 


Don't  just  search  on  keywords,  but  on  a  model  of 
what  the  words  mean. 

Meaning  is  still  very  difficult  to  ascertain,  but  we 
can  estimate  it  the  co-occurrence  of  other  words 


Correlations  can  be  discovered  by  probabilistic 
topic  models  like  Latent  Dirichlet  Allocation  or 
Hierarchical  Dirichlet  Processes. 


This  allows  us  to  find  documents  that  are  relevant 
to  the  query,  even  if  they  do  not  share  keywords 
with  the  query. 
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Example  Wikipedia  “Topics,” 
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Our  Experiments 


•  We  tested  on  a  standard  benchmark  data  set  from 
the  TREC  challenge. 

•  TREC  (Text  REtrieval  Conference)  is  a  yearly 
competition  sponsored  by  NIST.  They  provide 
standardized  datasets  for  information-retrieval 
research 


•  We  used  the  Cranfield  dataset  for  these  experiments, 
since  it  deals  with  abstracts  of  technical  documents 

•  Our  target  is  requirements  documents/abstracts). 
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Simple  Example 


•  Query  226  from  CRANFIELD: 

•  “how  should  the  navier-stokes  difference  equations  be 
solved” 


•  Relevant  documents  are  1063,  1078,  1080-1085,  a 
total  of  8  documents. 


•  A  pure  key-word  based  model  has  an  average 
precision  of  37%  on  this  model. 

•  A  pure  LDA  topic  model  using  10  Markov  chains  for 
sampling  gives  us  41% 

•  A  combination  of  both  gives  us  46%  precision. 

-  This  may  seem  low,  but  what  it  says  is  that  46%  of  the 
results  are  relevant  to  the  user. 


•  All  of  the  relevant  documents  are  retrieved  by  the 
22nd  document. 
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Two  of  the  Documents  found 


A  typical  keyword  result 


A  typical  LDA  result 


ESSE  Extendable  Semantic  Search  Engine 

4  C  S?  +  ^hnp://localhosi:8080/5emantic5earch/docview2.jsp?doc©  *  Q,-  class  file  has  wrong  vQ 

(T1  Information  .  .rieval  Links  Apple  Yahoo!  Google  Maps  YouTube  Wikipedia  'Jews  (6-761) ▼  Popular  ▼  » 

Instan...  Reque  Doubl  |0  CranD  |0  LDASe  Servle ...  ESSE-  Java;  Git  Us..  » 

1  match  <  ►  Ct  solve  Dane 

Retrieved  Document  1082 

Title:  the  flow  past  pitot  tube  at  low  reynolds  numbers,  part  1-dash 
the  numerical  solution  of  the  navier-stokes  equations  for  steady 
viscous  axisymmetric  flow,  part  2-dash  the  effects  of  viscosity  and 
orifice  size  on  a  pitot  tube  at  low  reynolds  numbers  . 


ESSE  -  Extendable  Semantic  Search  Engine 
4  |  ►  C  R:i  +  f  ■'http://localhost:8080/Semantic5earch/  *  Or  Google 


CP  Information  rieval  Links  Apple  Yahoo!  Google  Maps  YouTube 

Wikipedia  » 

Q  Instance.  . 

\Q  Request...  O  Double..  C  CranDoc ...  Q  LDASe  ar. 

0  ScrvlelR  » 

Retrieved  Document  1083 


Author(s):  tester,  w.g.s. 

Bibliographic  entry:  o.u.e.l.  no.  136,  a.r.c.  22,  070,  f.m.  2983  . 


Title:  an  investigation  of  fluid  flow  in  two 
dimensions  . 


the  flow  past  pitot  tube  at  low  reynolds  numbers,  part  1-dash  the 
numerical  solution  of  the  navier-stokes  equations  for  steady  viscous 
axisymmetric  flow,  part  2-dash  the  effects  of  viscosity  and  orifice  size 
on  a  pitot  tube  at  low  reynolds  numbers  .  in  this  report  numerical 
methods  used  to  solve  the  navier-stokes  equations  for  steady  viscous 
two-dimensional  flow  are  extended  to  include  the  case  of  axial 
symmetry  .  the  equations  and  their  finite  difference  approximations  are 
derived  working  in  cylindrical  polar  co-ordinates  with  the  stokes' 
stream  function  and  the  vorticity  as  variables  .  a  new  method  of 
dealing  with  the  boundary  conditions  is  given  .  the  effects  of  viscosity 
and  orifice  size  on  a  blunt-nosed  pitot  tube  have  been  theoretically 
investigated  up  to  a  reynolds  number  of  ten,  where  the  reynolds 
number  has  been  based  on  the  radius  of  the  tube  .  results  are 
expressed  in  terms  of  a  pressure  coefficient  where  p  is  the  pressure 
measured  in  the  tube,  p  the  density  of  the  fluid,  and  p  and  u  the  static 
pressure  and  velocity  in  an  undisturbed  flow  at  the  position  of  the  tube 
.  the  values  of  c  for  a  blunt-nosed  tube  are  found  to  be  less  than  those 
for  tubes  with  hemispheroidal  heads,  but  always  greater  than  unity  in 
the  range  considered  .  the  effect  of  the  orifice  size  is  to  decrease  c  as 
the  orifice  si2e  increases,  this  decrease  is  very  small  but  increases  with 
the  reynolds  number  .  at  a  reynolds  number  of  ten  the  decrease  is  at 
most  five  per  cent  of  the  value  of  c  when  there  is  no  orifice  .  it  is 
suggested  that  the  decrease  of  c  below  unity  found  in  some 
experimental  investigations  at  a  higher  reynolds  number  could  be  due 
to  the  effects  of  orifice  size  . 


Author(s):  thorn,  a. 

Bibliographic  entry:  a.r.c.,  r  +  m,  1194.  1928  . 

an  investigation  of  fluid  flow  in  two  dimensions  .  flow 
of  an  inviscid  fluid  .  -dash  there  are  in  existence 
several  methods  of  obtaining  numerical  solutions  to 
the  two-dimensional  flow  of  a  perfect  fluid  for  given 
boundary  conditions  .  part  2  of  the  present  paper  gives 
a  method  of  obtaining  a  numerical  solution  for  viscous 
steady  flow  solution  of  the  simpler  problem, 
illustrating  it  with  examples  bearing  on  the 
experimental  work  described  in  part  iv  . 
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A  Closer  Look 


Document  1082  (found  by 
keyword  search) 


Document  1083  (not  found  by 
keyword  search,  but  by  LDA) 


•  "...  report  numerical 
methods  used  to  solve  the 
navier-stokes  equations  for 
steady  viscous  two- 
dimensional  flow  are 
extended  ...” 

•  This  document  is  ranked  high 
in  a  keyword  search  model. 


•  "...  numerical  solutions  to 
the  two-dimensional  flow  of 
a  perfect  fluid  for  given 
boundary  conditions  ...” 

•  This  document  has  very  low 
rank  in  a  keyword  search,  but 
is  very  relevant  to  our  query. 

•  These  words  are  in  the  same 
“topic”  as  the  query,  even  if 
they  don’t  match. 


Remember  the  query:  "how  should  the  navier-stokes 
difference  equations  be  solved" 
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The  Power  of  Combination 
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Modular  Semantic  Search  Engine 
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Thank  You 
Questions? 


Craig  Martell 
cmartell@nps.edu 
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